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Method for decoding a data secjuence encoded with the aid of a binary 
convolutional code 

The present invention relates to a method for decoding a data se- 
quence which consists of K information bits and has been encoded 
with the aid of a binary convolutional code, using a MaxLogMAP algo- 
rithm. 



Voice channels and data channels of a radio communications system, 
which is designed to operate to the GSM/EDGE mobile radio communica- 
tions standard for example, use binary convolutional codes for data 
coding and data decoding. A preferred algorithm for what is known as 
15 "soft input/soft output decoding" is the known "symbol -by- symbol 

log -likelihood maximum a posteriori probability" algorithm (LogMAP 
algorithm) , which is generally implemented with the aid of a maximum 
approximation (MaxLogMAP algorithm) . 

20 The basic MAP algorithm is described, for example, in the publica- 
tion "Optimal Decoding of Linear Codes for Minimizing Symbol Error 
Rate", L.R. Bahl et al., IEEE Transactions on Information Theory, 
pp. 284-287, March 1974. The MaxLogMAP algorithm can be found in the 
publication "Iterative Decoding of Binary Block and Convolutional 

25 Codes", J. Hagenauer et al., IEEE Transactions on Information The- 
ory, vol. 42, no. 2, pp. 429-445, March 1996. 

A window MaxLogMAP algorithm implemented with the aid of what is 
known as a "sliding window" (decoding window) is described in the 
30 publication "An Intuitive Justification and a Simplified Implementa- 
tion of the MAP Decoder for Convolutional Codes", A. J. Viterbi, 
IEEE Journal on Selected Areas in Communications, vol. 16, no. 2, pp. 
260-264, Feb. 1998. 

35 The basis for decoding a data sequence encoded with the aid of a bi- 
nary convolutional code is the binary trellis diagram. One segment 
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of the trellis diagram belonging to one information bit of the data 
sequence detects all possible combinations of m (convolutional code 
memory length) preceded information bits as 2"* initial states. Fur- 
thermore, all ensuing ''conversions" (codings) of the information bit 
5 are detected as 2^""*^^ state transitions, and resulting 2"" target 

states are detected as initial states for the next consecutive in- 
formation bit. Here an information bit sequence corresponds to a 
specific path within the trellis diagram, a sequence of the most 
probable information bits in the trellis diagram being established 
10 with the aid of the MaxLogMAP algorithm. 

When the MaxLogMAP algorithm is implemented, a distinction essen- 
tially has to be made between a systolic implementation and a proc- 
essor-oriented implementation. 

15 

With the systolic implementation the aim is to achieve as high a de- 
gree of parallelism as possible of decoding steps throughout the 
trellis diagram. This implementation is used with extremely high 
data throughput requirements of up to 50 Gbit/s. 

20 

Processor-oriented implementation is suitable for moderate data 
throughput requirements of a few Mbit/s using low-cost hardware. 

For both implementations it is assumed that, for large data se- 
25 quences transmitted block-by-block, decoding can be usefully imple- 
mented only with the aid of a decoding window. 

For reasons of data throughput and hardware cost the window MaxLog- 
MAP algorithm is generally used for decoding. 

30 

Decoding results (soft output values) which are, by way of compari- 
son, obtained using a MaxLogMAP algorithm without a decoding window 
may be more precise, but expensive hardware and memories are re- 
quired for this purpose. 

35 



2002P01088 



PCT/EP03/02942 



An alternative to the MaxLogMAP algorithm, based on a block error 
probability with a given signal-to-noise ratio, is provided by the 
"soft output Viterbi algorithm" (SOVA) described in DE 39 10 739 C3 
and DE 42 24 214 C2 . However, the SOVA algorithm has a smaller cor- 
5 relation between decoding errors and soft output values formed than 
the MaxLogMAP algorithm. 

The object of the present invention is to carry out a decoding of a 
data sequence encoded with the aid of a binary convolutional code, 
10 using a MaxLogMAP algorithm, in such a manner that precise soft out- 
put values are formed as decoding results with inexpensive hardware. 

The object of the invention is achieved by the features of Claim 1. 
Advantageous developments are indicated in the subclaims . 

15 

The method according to the invention is a processor-oriented imple- 
mentation of the MaxLogMAP algorithm. 

As a result of the memory cascading according to the invention on 
20 the one hand and the use of interpolation nodes for metrics calcula- 
tion on the other hand, the MaxLogMAP algorithm can be efficiently 
integrated on exactly one application-specific module (ASIC) . 

The non-use of a decoding window must not entail a loss of accuracy 
25 of the decoding results. 

According to the invention the metrics values are calculated produc- 
tively in a first calculation operation and reproductively in fur- 
ther calculation operations based on the stored interpolation nodes. 

30 

The memory cascading according to the invention enables an optimal 
data throughput to be achieved. 

The method according to the invention saves memory space and there- 
35 fore area on an ASIC module. The area saved is therefore available 
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for further signal processing algorithms, which means that addi- 
tional algorithms can be implemented in the ASIC module. 

An exemplary embodiment of the invention is described below with 
5 reference to a drawing, in which: 

FIG 1 is a diagrammatic representation of a MaxLogMAP algorithm for 
the precision calculation of soft output values according to 
the prior art, 

FIG 2 is a diagrammatic representation of a window MaxLogMAP algo- 
rithm for the calculation of soft output values according to 
the prior art, 

FIG 3 is a diagrammatic representation of a MaxLogMAP algorithm ac- 
cording to the invention for the precision calculation of soft 
output values, and 
FIG 4 is an example of the metrics value calculation and storage ac- 
cording to the invention. 

FIG 1 is a diagrammatic representation of a MaxLogMAP algorithm for 
the precision calculation of soft output values according to the 
prior art. 

The MaxLogMAP algorithm is used to decode a data sequence which con- 
sists of K information bits and has been encoded with the aid of a 
25 binary convolutional code. 

On a trellis diagram TREL, starting at a trellis segment Tl, alpha 
metrics values Ma- calc- store are calculated and stored for every in- 
dividual trellis segment TSN as logarithmic transition probabili- 
30 ties. At the same time, in parallel therewith, starting at a trellis 
segment T2, beta metrics values MP-calc-store are calculated and 
stored for every individual trellis segment TSN. 

The two calculations pass each other at a trellis segment TSM, a de- 
35 cision process being carried out from this time onwards for the pur- 
pose of calculating a soft output value, i.e. an information bit of 
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the data sequence is decoded. When that happens, in the course of a 
"forward decision process" FDP, after the trellis segment TSM has 
been passed, currently calculated alpha metrics values Ma-calc are 
used with the previously calculated and stored beta metrics values 
5 Mp-calc-store to calculate the soft output value. 

This procedure takes place at the same time in a "backward decision 
process" BDP, wherein currently calculated beta metrics values mP- 
calc are used with the previously calculated and stored alpha met- 
10 rics values Ma-calc-store to calculate the soft output value. 



The following reference characters are used below: 



K number of information bits, 

s state of a convolutional code decoding, 

15 m code memory length, 

T required number of trellis segments, 

where T = K + m, and 
71 length of a transient phase, where n > 5*m 



20 With a word width w of a metrics memory and assuming that the re- 
spective metrics values are standardized, two metrics processors and 

a total of 2'K/2'W'2'" memory locations are required for this imple- 
mentation of the MaxLogMAP algorithm. The data throughput achieved 
is as follows: 

25 ^ 7- [Mbit/s] , 

where the above parameter tsegment is dependent on the module technol- 
ogy (ASIC) used to implement the algorithm, on the memory architec- 
ture and on the clock speed used with the ASIC module. 



30 



In the case of terminated codes, each calculation of the metrics 
values starts from the assumption of an uneven probability distribu- 
tion in a trellis segment having an initial state with a probability 
of 100%, while all further states have a probability of 0%. 
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In the case of what are knovm as ''tailbiting codes" no initial state 
is known at the outset. A transient phase of length n therefore has 
to be introduced for both directions. Associated metrics values for 
5 the transient phase are indicated as Ma-pre and MP-pre respectively. 

FIG 2 is a diagrammatic representation of a window MaxLogMAP algo- 
rithm for the calculation of soft output values according to the 
prior art. 

10 

The window MaxLogMAP algorithm, which is^ implemented with the aid of 
a sliding decoding window, is used for long data sequences. The par- 
ticular advantage of the window MaxLogMAP algorithm is its efficient 
implementat ion . 

15 

Similarly to FIG 1, alpha metrics values Ma-calc are calculated pre- 
cisely in a forward direction starting at a trellis segment Tl. By 
contrast, beta metrics values Mf-calc-l of a first decoding window 
DPI and beta metrics values Mp-calc-2 of a second decoding window 
20 DP2 are estimated. 

If the two decoding windows DPI and DP2 reach the right-hand edge of 
the trellis diagram TREL, all the termination information is avail- 
able. The beta metrics values MP-calc-1 and MP-calc-2 are calculated 
25 precisely, and the corresponding soft output values are formed. 

Metrics values Ma-pre, MP-pre-1 and MP-pre-2 are again assigned to a 
transient phase. 

30 The window MaxLogMAP algorithm is described in detail in the above- 
mentioned publication "An Intuitive Justification and a Simplified 
Implementation of the MAP Decoder for Convolutional Codes" . 
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For this implementation, with a data throughput similar to that of 
FIG 1, in total [iv/^"| + l metrics processors and + w-2'" stor- 

age locations are required, 

5 where : 

v|/ is the number of trellis segments for which 2"* metrics are 

stored in each case, 
w is the size of the decoding window in trellis segments, 

and 

10 6 e {0,1} is a parameter which is dependent on the module technol- 
ogy (ASIC) used to implement the algorithm, on the memory 
architecture and on the clock speed used with the ASIC 
module . 



15 Especially where high speed channels have code rates close to one 
(generated with the aid of a points system) , use of the decoding 
window leads to unacceptable deteriorations in performance, however. 

FIG 3 is a diagrammatic representation of a MaxLogMAP algorithm ac- 
20 cording to the invention for the precision calculation of soft out- 
put values. 

Similarly to FIG 1, alpha metrics values are again calculated in a 
forward direction and beta metrics values in a backward direction of 
25 each trellis segment TSN. However, now, in accordance with the in- 
vention, only metrics values of a selected number of trellis seg- 
ments which act as interpolation nodes are selected and stored. 

In a preferred embodiment they are stored in a memory divided into 
3 0 levels in a cascaded manner. 



The following reference characters are used below: 
m code memory length. 
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sm memory length shift of a feed- forward terminated code (in 

the case of a recursively terminated code and in the case 
of a tailbiting code sm = 0) , 

K number of information bits, 

5 T required number of trellis segments, 

where T = K + m; 

71 length of the transient phase where n > 5*m , 

8(1) memory depth of a first memory level SP(1) for interpola- 

tion nodes of a first metrics value calculation, 
10 8(n-l) memory depth of an n-l-th memory level SP(n-l) for inter- 

polation nodes of an n-l-th metrics value calculation, 
and 

5(n) memory depth of an n-th memory level SP(n) for interpola- 

tion nodes of an n-l-th metrics value calculation. 

15 

On the trellis diagram TREL, in a first operation, starting at a 
trellis segment Tl, alpha metrics values Ma-calc(l) are calculated 
in a forward direction FDP and, starting at a trellis segment T2, 
beta metrics values MP-calc(l) are calculated in a backward direc- 
20 tion BDP for each one of the trellis segments TSN as logarithmic 
transition probabilities. 

According to the invention, however, from the calculated metrics 
values Ma-calc(l) and MP-calc(l) of the first operation, metrics 
25 values Ma-calc-sel (1) and MP-calc-sel (1) of the first operation are 
each now filed in a first memory level SP(1) with a memory depth of 
8(1) for a selection of K/8(l) trellis segments acting as interpola- 
tion nodes. 



30 In a second operation, on the basis of each pair of adjacent inter- 
polation nodes of the first operation, metrics values Ma-calc(2) and 
MP-calc(2) are calculated for the trellis segments TSN positioned 
between the respective interpolation nodes of the first operation. 
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As in the first operation, from the calculated metrics values Ma- 
calc(2) and MP-calc(2) of the second operation, for a selection of 
K/8(l)/8{2) trellis segments again acting as interpolation nodes the 
corresponding metrics values Ma-calc-sel (2) and MP-calc-sel (2) of 
5 the second operation are filed in a second memory level SP(2) with a 
memory depth of 8(2) . 

This metrics value calculation based on the interpolation nodes of a 
previous operation is accordingly continued both in a forward direc- 
10 tion and in a backward direction. During this process, after the 

trellis segment TSM is passed, corresponding soft output values are 
formed, memory levels that are released being accordingly reused. 

Here the decision process for the determination of the soft output 
15 values is as described in FIG 1. The individual memory levels are 
arranged in a mutually cascaded structure. 

After n operations all soft output values are determined, the n-th 
memory level having a memory depth of 8(n) with stored metrics val- 
20 ues of K/8(l) /8{2) / . . . ./ 8{n) trellis segments or interpolation 
nodes . 

Comparing this with the data throughput stated in FIG 1 and FIG 2, 
the method according to the invention requires a total of 2'/i met- 

r n \ 



25 rics processors and + 



w-2'" storage locations. 



The following applies: =Ar, e {0,l} , according to the number 

(=1 

of ASIC clock cycles recjuired for one trellis segment and the number 
of ports available on the memories. 

30 

On the basis of existing implementations of the MaxLogMAP window al- 
gorithm, parameters can be deduced and can be used for a comparison 
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between the conventional implementations known from FIG 1 and FIG 2 
and the implementation according to the invention. 
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The results can be found in the following Table: 





Logic 
[Kgates] 


Memory 
[Kbit] 


"Area" 
[Kgates] 


MaxLogMAP 


101 


1080 


2300 


Window MaxLogMAP 


233 


138 


500 


Memory- cascaded 
implementation 


233 


106 


450 



5 With the same data throughput and with no loss of accuracy in the 

MaxLogMAP algorithm as a result of the non-use of a decoding window, 
hardware costs for the memory- cascaded implementation according to 
the invention are reduced by over 80% compared with the conventional 
implementation described in FIG 1. 

10 

The following assumptions were made for this comparison: 



- an overhead resulting from soft input and soft output buffer at a 
systems interface with {R~^ ^'i)- ^ soft ' ^ ' ^memory 

15 where k=K generally applies but k=W+ ( (ng - 1) *5) may apply to im- 

plementation with decoding window; where R is code rate, Wgoft is 
word width of the soft values and amemory is unit area per memory 
bit, 

- an overhead resulting from control module and interface: 

2 0 ^overhead * 

- a metrics processor including scaling of the register bits: 

A^UeM' f parallel + 2^""*^ ' a^^^,^ 

where Aviterw represents the Viterbi arithmetic for a given word 
width w and afupfiop represents the register unit area per bit, fpar- 
25 aiiei is the number of butterfly calculations made in an ASIC cloclc 
cycle and can assume the following values: {l,2, 2^"*"^^}, 

- a metrics word width w = 16, 

- a soft value word width Wgoft = 8, 

- a memory bit area aaemory = 2, 
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- a register bit area ampfiop -10 units, 

- a Viterbi arithmetic/butterfly A^iterbi = 12500 units, 

- an overhead Aoverhead = 50000 units. 

5 The units correspond to a gate equivalent of a 0.18 pm ASIC module 
at a clock frequency of approx. 150 MHz. 

Data from a parameter set relevant to the GSM/EDGE standard are com- 
pared below: 
10 - code rate R = 1/6 

- memory length m = 6 

- block size K = 1000 

- decoder throughput greater than 2 Mbit/s 

- parallelism fparaiiei = 1 



15 
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Configuration 
(® 2.6 Mbit/s) 


"Area" 
[Kgates] 


Exact implementation 


2 metrics processors 


2250 


Window implementation 
(path fusion limit = 
160) 

window also with soft 
input and soft output 
memory 


7 metrics processors 
(1+9) metrics memory 
with 6=32, 0=1 

1 soft input and soft output 
memory with window 


480 


Window implementation 
(path fusion limit = 
160) 


6 metrics processors 
(1 + 0) metrics memory 
with 6=40, 0=1 

1 soft input and soft output 
memory 


500 


Window implementation 
(path fusion limit = 
160) 


5 metrics processors 
(1+9) metrics memory 
with 6=54, 9=0 

1 soft input and soft output 
memory 


425 


Memory- cascaded imple- 
mentation 


6 metrics processors 
2 interpolation nodes memory 
at the l^*' level with 5i=18; 
(1+02) *2 interpolation nodes 
memory at the 2"^ level with 

(1+03) *2 metrics memories with 
63=7, 03=1; 

1 soft input and soft output 
memory 


450 
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Configuration 
(® 2 .6 Mbit/s) 


*^Area" 
[Kgates] 


Memory- cascaded imple- 


6 metrics processors 


400 


mentation 


2 interpolation nodes memories 






at the 1°^ level with 6i=10; 






(1 + 02) *2 interpolation nodes 






memory ac trie z levei wicn 






62=10, 02=0; 






(1 + 83) * 2 metrics proces- 






sors with 63=10, 63=0; 






1 soft input and soft output 






memory 





In the case of UMTS (W-CDMA) and with UTRAN TDD convolutional codes, 
the following parameters are to be used for the convolutional decod- 
5 ing: 

- code rate R = 1/3 

- memory length m = 8 

- maximum block size K = 300 

- decoder throughput greater than 2 Mbit/s 
10 - parallelism fparaiiei = 8 





Configuration 
(® 3.125 Mbit/s) 


^^Area" 
[Kgates] 


Exac t impl emen t at ion 


2 metrics processors 


2891 


Window implementation 
with sliding window on 
soft input / soft output 
memory 


6 metrics processors 
5=40, 0=1 


1848 


Memory- cascaded imple- 
mentation 


4 metrics processors 
5i= 20, 62 = 15 

01 = 0, 02 = 1 


1206 



In the case of UMTS (W-CDMA) and with UTRAN TDD turbo codes a 
slightly expanded MaxLogMAP decoder can be used as part of the 
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turbo-decoding. This means that, here too, memory- cascaded implemen- 
tation can be compared with direct and window implementation: 

- code rate R ~ 1/3 
5 - memory length m = 3 

- maximum block size K = 5200 

- decoder throughput greater than 2 Mhit/s 

- parallelism ^ parallel — 

1/4 





Configuration 
{®3 . 125 Mbit/s) 


^^Area" 
[Kgates] 


Exact implementation 


2 metrics processors 


1727 


Window implementation 
with sliding window also 
for soft input/soft out- 
put memory 


6 metrics processors 
5=40, e=i 


159 


Memory- cascaded imple- 
mentation 


8 metrics processors 
5i=13; 62=10, 63=8, 64=5 
01 = 0, Gi = 1 for 
i = {2,3,4} 


448 



10 

FIG 4 shows an example of metrics value calculation and storage ac- 
cording to the invention. 

15 

In a first operation Dl, alpha metrics values and beta metrics val- 
ues are calculated. For each sixth trellis segment TSN =1, 7, 13, 
73 acting as interpolation point, 2" calculated alpha metrics 
values are stored in a memory level SP(al) and 2"^ calculated beta 
20 metrics values are stored in a memory level SP(pi) . 

In a second operation D2, these metrics values are read out of the 
memory levels SP(al) and SP(pl) and, for the trellis segments posi- 
tioned between the interpolation nodes, the associated alpha metrics 
25 values and beta metrics values are calculated precisely. Relevant 
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trellis segments are again selected as interpolation nodes, and the 
associated metrics values are stored. Two memory levels SP(a2) and 
SP(a2'), and SP(p2)and SP(P2'), respectively, are indicated here by 
way of example. 

5 

In a third operation D3 , the trellis segment TSM is reached from 
both sides, and currently calculated 2"" beta metrics values with 
stored 2^ alpha metrics values filed in the memory SP(a2) are used 
for the backward decision process to determine soft output values. 
10 In exactly the same way, currently calculated 2™ alpha metrics values 
with stored 2^ beta metrics values filed in the memory SP(P2) are 
used for the forward decision process to determine soft output val- 
ues . 

15 After a total of n=3 operations Dn all the decisions values Maecisions 
were formed. 



20 



